ZAKIETAL. Pa<>c 1 of 26 



I f 

^ Wty Docket: HIRA-SO 



FIG. 1 



101 



Gene expression 
pattern data 



105 



Clustering processor 



102 




Display 



103 



Keyboard 




1 '26 



1 f 



,'OZAKI ET AL. Page 2 of 26 
r w ^Uy Docket: HIRA-80 

FIG. 2 



202 




2'26 



FIG.3 



NOZAKI FT AL. Page 3 of 26 
Atty Docket: HIRA-SO 



Time 



c 

<L> 

a 




Clustering-applied region 



3/26 



_EDZAKIETAL. Page 4 of 26 



Am 



, A .aty Docket: HIRA-SO 



FIG. 4 



( Start J 









Setting initial parameters 










Process A 

Determining display area 










Displaying 





401 



402 



403 




4/26 



,r- Jc. _ NOZAKIETAL. Paue5ot'26 

.^g> ^ w Atty Docket: IIIRA-80 

n '■ 

' FIG. 5 



Length S 
M ► 



T 0 T, T 2 



T T 

start 1 i 



go 

gl 

g2 



Gene expression data 



























G 0 
























G, 
























G, 




































■! 












Gi 










/ 

guiti: 






































G m 



LMld 



fm a m m m m m ■' 



Slit 501 



7 



L 



502 



l[0] 



/[I] 



i[21 



/[I]" 



index 



linepos 



/[m] 



Average expression Array for stor i ng 
level of each gene display information 



V 

Clustering-applied region 
201 



Clustering threshold (K.) 



T 

K cnd 



5 '26 



^OZAKI ET AL. Page 
'tty Docket: HIRA-80 



FIG. 6 



Q Start ^) 



601 



Reading out data of gene expression patterns (setting g[j][i]) 



602 



Setting clustering-applied region (T star1 , T end ) 
Setting positive values indicating reference values for 
distinguishing clusters ( K start ,... -K end ) 
Setting an integer indicating clustering range (S) 
Selecting means of clustering 



Error display 



NO 




608 



Setting average expression levels for genes gi( i=l,2 m) (Gj) 



609 



Initializing /[IJ.Iinepos as T end for array /[I](I=0,1 m) 

indicating display information of each gene 



610 



Initiating display information counter variable lmax as zero 





t <— 


start 


^ — 61 1 

^612 








Generatinu cluster B to which {0,1 m} is reeistered 







( End ) 



4 



FIG. 7 



NOZAKI ET AL. Page 7 of 26 
Atty Docket: HIRA-80 



Q Start ) 



701 



B «— the cluster(given) 
t «— the time(given) 



702 



Process B: Subject cluster B to clustering. 

Using t and B as arguments, the total number of 
the resulting clusters and the clustering results are 
set as cmax and A[J](J=l,2,...,cmax), respectively. 




YES 



NO 



J — 1 



YES 




705 



706 



Where elements of cluster A[J] are i,,..-,i k , 
average expression levels G^,... G lk are alined 
in a descending order to be rearranged as 



707 



/[lmax]. index *— j, 
/[lmax+1]. index *— j 2 

/[lmax+k— 1]. index *— j k 



708 







/[lmax+k — 


J.linepos <— t 



709- 



lmax lmax + k 



J «- J + 1 

czz 



710 



711 



t«-t+ 1, J «- 1 




B «- A[J] 



714 





Process A: 






detem 


nining display area 






again using t and B as 








arguments 





/[lmax— l].linepos *— t 



715 



J «- J+ 1 <~ 



716 



End ^) 



7/26 



Js'OZAKI ET AL. Page 8 of 26 
VUty Docket: HIRA-80 



FIG. 8 



Start ) 



B *— the cluster(given) 
t *— the time(given) 



801 



802 



Calculate dissimilarity JyCi <j and i.js {i|,...,i k }) 
between genes corresponding to elements i i k 
of cluster B from time t to time t+S 



Generate clusters C[l],...,C[k] to which 
{ i, } { i k } are registered, respectively 



803 



cent 



804 



r 



805 



Generate cluster D. Initialize D as a null set. 



806 



d_ q «— { minimum value of ^.(ij e {1,2,..., cent} -D) } 




NO 



808 



Generate new cluster C [ccnt+1] to which a sum set 
of elements contained in clusters C[p] and C[q] are registered 



Delete elements contained in clusters C[p] and C [q] 



809 



Register p and q to D 



810 



Calculate dissimilarity d h ccnt+1 between cluster 

C[h] (h <E{l'.2.....ccnt}-D) 
and cluster C[ccnt+1 ] from time t to time t + S 



cent cent + 1 



812 



8/26 



•'" : - ^ NOZAKI ET AL. Page 9 of"26 

^ , Atty Docket: I URA-80 

'V 1 



FIG. 9 



© 



813 



Let cmax be the total number of 
clusters C[l] to C[ccnt] without a null set 



r 



814 



Generate cluster A[l], A[cmax] 



815 



Calculate average expression levels of genes contained 
in the clusters without a null set to obtain 

G' p = (G n +...+ G ik )/ k for clusters C[p] = {i, i k }. 

Sequentially align the obtained values in a descending order 

G; p G' pcmax ). Then, register C[p^... C[p cmax ] 

as A[l] , ... , A[cmax], respectively. 



Output the total cluster number 
cmax and clusters A[l] A[cmax] 



816 




926 



*' ^ ^)ZAKI ET AL. Page 1 0 of 26 

} ■ Atty Docket: IIIRA-80 

FIG. 10 



Start ) 




Convert the values of expression data 
g[k][x](k = 0,l,...,m)of a gene indicated 
by x *— /[i]. index to correlating display 
colors to display as an i th row 



^-1003 

Draw a dividing line between the clusters 
immediately below the just displayed row 
from time /[ij.linepos to T end 




1004 



i «- I + 1 



( End ) 



1 0/26 



f^AKI ET AL. Page 1 1 of 26 
/Docket: HIRA-80 



FIG. 11 




1 1/26 



NOZAKI ET AL. Page 12 of 26 
Ally Docket: HIRA-SO 



FIG. 12 




1103 



201; Clustering-applied region 



NOZAKI ET AL. Pauc 13 of 26 



• ^ ^Atty Docket: HIRA-80 



a. 



FIG. 13 



Exemplary display of the results of clustering analysis 




1 3/26 



NOZAKI ET AL. Page 14 of 26 
^ty Docket: HIRA-80 



A- 



FIG. 14 



Example of gene expression pattern data 
^ — 1401 

Experiment cases 



Gene ID 
(genelD) 

1 

2 



m 



1 


2 




4 


5 


6 










n 


0 


1 


2 


0 




2 


0 












1 


2 


0 


0 


2 


2 


1 











0 


4 


3 


6 


5 


4 


0 











^ 1402 

m number of 
vector data 



14/26 



,'OZAKI ET AL. Page 15 of 26 
tty Docket: HIRA-SO 



FIG. 15 



uster 



Example of cluster structures 
cluster 



Members 


Value 


type 


leaf 


left 




right 




distance 




genelD 


17 


level 


0 



1501 



Members 


Value 




type 


node 


left 




• 


► 

► 


right 


• 


distance 


91 




genelD 




level 


5 



502 



cluster L 
cluster R 



1 5/26 



KOZAKI ET AL. Pag 
Atty Docket: HIRA-80 



FIG. 16 




16/26 



NOZAKI IZT AL. Page 17 of 26 
Atty Docket: HIRA-80 



FIG. 17 



Exemplary display data 



1702 

intervalsf] 

1 2 3 4 5 6 



level 




1701 



Interval structure 



Members 



num clust 



min dist 



max dist 



borders 



Value 



517 



11 



28 



-►{3, 16,31,56, 127,...} 
^1703 



disp_leaf_clusters[] 



1 


2 




4 


5 


6 


nclus 


51 


16 


33 



















w 



Gene ID 



1 7/26 



NOZAKI ET AL. Page 1 
Atty Docket: HIRA-SO 



FIG. 18 



Flowchart of general process 
START ^ ) 



1800 



Reading out gene expression pattern data 



Setting various parameters 
for clustering analysis 



Clustering analysis 



1801 



1802 



Display the results of the analysis 



1803 



END ^ ) 



NOZAKI ET AL. Page 19 of 26 
Ally Docket: HIRA-80 



FIG. 19 
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Detailed flowchart of clustering analysis (2: Setting cluster levels) 
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Detailed flowchart of clustering analysis (3: Generating display data) 
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